Picture for Han Li

Han Li

Southern University of Science and Technology

MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?

Add code
Jun 01, 2026
Viaarxiv icon

VCap: Hypergeometric Rewards for Weak-to-Strong Visual Captioning

Add code
May 27, 2026
Viaarxiv icon

Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation

Add code
May 26, 2026
Viaarxiv icon

MetaphorVU: Towards Metaphorical Video Understanding

Add code
May 25, 2026
Viaarxiv icon

TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks

Add code
May 21, 2026
Viaarxiv icon

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Add code
May 19, 2026
Viaarxiv icon

SimGym: A Framework for A/B Test Simulation in E-Commerce with Traffic-Grounded VLM Agents

Add code
May 19, 2026
Viaarxiv icon

Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control

Add code
May 14, 2026
Viaarxiv icon

SimPersona: Learning Discrete Buyer Personas from Raw Clickstreams for Grounded E-Commerce Agents

Add code
May 14, 2026
Viaarxiv icon

PDEAgent-Bench: A Multi-Metric, Multi-Library Benchmark for PDE Solver Generation

Add code
May 10, 2026
Viaarxiv icon